Selecting Features for Paraphrasing Question Sentences

نویسندگان

  • Noriko Tomuro
  • Steven L Lytinen
چکیده

In this paper we investigate several schemes for selecting features which are useful for automatically classifying ques tions by their question type We repre sent questions as a set of features and compare the performance of the C machine learning algorithm using the dif ferent representations Experimental re sults show a high accuracy rate in cat egorizing question types using a scheme based on NLP techniques as compared to a scheme based on IR techniques The ultimate goal of this research is to use question type classi cation in order to help identify whether or not two ques tions are paraphrases of each other We hypothesize that the identi cation of fea tures which help identify question type will be useful in the generation of ques tion paraphrases as well

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

مقایسه روش‌های مختلف یادگیری ماشین در خلاصه‌سازی استخراجی گفتار به گفتار فارسی بدون استفاده از رونوشت

In this paper, extractive speech summarization using different machine learning algorithms was investigated. The task of Speech summarization deals with extracting important and salient segments from speech in order to access, search, extract and browse speech files easier and in a less costly manner. In this paper, a new method for speech summarization without using automatic speech recognitio...

متن کامل

Lexical Paraphrasing for Document Retrieval and Node Identification

We investigate lexical paraphrasing in the context of two distinct applications: document retrieval and node identification. Document retrieval – the first step in question answering – retrieves documents that contain answers to user queries. Node identification – performed in the context of a Bayesian argumentation system – matches users’ Natural Language sentences to nodes in a Bayesian netwo...

متن کامل

Automatic Expansion of Equivalent Sentence Set Based on Syntactic Substitution

In this paper, we propose an automatic quantitative expansion method for a sentence set that contains sentences of the same meaning (called an equivalent sentence set). This task is regarded as paraphrasing. The features of our method are: 1) The paraphrasing rules are dynamically acquired by Hierarchical Phrase Alignment from the equivalent sentence set, and 2) A large equivalent sentence set ...

متن کامل

Hybrid System Combination for Machine Translation: An Integration of Phrase-level and Sentence-level Combination Approaches

Hybrid System Combination for Machine Translation: An Integration of Phrase-level and Sentence-level Combination Approaches Wei-Yun Ma Given the wide range of successful statistical MT approaches that have emerged recently, it would be beneficial to take advantage of their individual strengths and avoid their individual weaknesses. Multi-Engine Machine Translation (MEMT) attempts to do so by ei...

متن کامل

The Performance of Iranian EFL Learners in Producing and Recognizing Idiom-Containing Sentences

This study aimed to investigate how Iranian EFL learners performed in producing sentences containing idioms and whether they had any problems in producing such sentences. This query, subsequently, raised the question of whether idioms influenced the participants’ grammaticality judgment on idiom-containing sentences. For this purpose, firstly, the writings of 24 learners were investigated for a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001